HITTING ALL MAXIMAL INDEPENDENT SETS OF A BIPARTITE 

GRAPH 

JEAN CARDINAL AND GWENAEL JORET 



Abstract. We prove that given a bipartite graph G with vertex set V and an integer 
k, deciding whether there exists a subset of V of size at most k hitting all maximal 
independent sets of G is complete for the class £2 . 

1. Introduction 

A maximal independent set in a graph is a subset of pairwise nonadjacent vertices that is 
maximal with respect to inclusion. We consider the following problem: given a graph G 
with vertex set V and an integer k, does there exist a subset of V of size at most k that 
intersects every maximal independent set of G? While this problem is well known to be 
NP-hard, it is not known to belong to NP, hence not known to be NP-complete. We prove 
that this is unlikely to be the case: the problem is complete for the class = NP NP , at 
the second level of the polynomial hierarchy. Furthermore, our proof holds even in the 
case where the input graph is bipartite. 

1.1. Previous works. We distinguish three lines of work that are closely related to 
our result. They concern respectively the clique transversal problem, fibres in partially 
ordered sets, and the clique coloring problem. 

1.1.1. Clique transversals. A clique is a subset of pairwise adjacent vertices. The problem 
we consider is often formulated in the complement graph, where we wish to hit every 
maximal clique with a subset of the vertices. Such a subset is called a clique transversal, 
and the corresponding clique transversal problem consists of deciding whether there exists 
a clique transversal of size at most k. 

Early investigations can be found in Aigner and Andreae [1], Tuza [19], and Erdos, 
Gallai, and Tuza [10]. The latter contains the first NP-hardness proof for the problem. 
The authors also showed that there are graphs on n vertices where the minimum size of 
a clique transversal is as large as n — o(n). 

Balachandran, Nagavamsi, and Rangan [5] observed that the minimum clique transversal 
problem was solvable in polynomial time on comparability graphs, a consequence of 
Menger's theorem. Clique transversals of line graphs and chordal graphs have been 
considered by Andreae, Schughart, and Tuza [3], and Andreae [2], respectively. 
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Later, Guruswami and Pandu Rangan [12] proved the NP-hardness of the clique transver- 
sal problem in a number of restricted cases, including planar graphs and line graphs. They 
also proved it remains NP-hard on complements of bipartite graphs, which corresponds 
exactly to our problem, and provided polynomial-time algorithms for some other cases, 
including strongly chordal graphs and Helly circular-arc graphs. 

Duran, Lin, and Szwarcfiter [9] studied clique-perfect graphs, defined as graphs for which 
the minimum size of a clique transversal is equal to the maximum number of pairwise 
disjoint maximal cliques. 

More recently, Shan, Liang, and Cheng [18] gave improved bounds on the size of a 
minimum clique transversal in cubic graphs. Duran, Lin, Mera, and Szwarcfiter [8] gave 
a more efficient algorithm for finding a minimum clique transversal of a Helly circular 
arc graph. The reader is referred to those two papers for more references. 



1.1.2. Fibres in posets. The notion of fibre of a poset is closely related to the clique 
transversal problem. Given a partially ordered set P = (X, ^), a fibre is a subset of X 
that intersects every maximal antichain of P, that is, every maximal subset of pairwise 
incomparable elements. Hence fibres are clique transversals of the complement of the 
comparability graph of the poset. 

Aigner and Andreae [1] asked the question of whether there always existed a fibre of 
size at most half the number of elements, provided the poset has no splitting element. 
Lone and Rival [14] further conjectured that the elements of any finite poset without 
splitting element could be partitioned into two fibres. This was answered in the negative 
by Duffus, Sands, Sauer, and Woodrow [7]. Duffus, Kierstead, and Trotter [6] proved, 
however, that every poset without splitting element has a fibre of size at most 2/3 its 
number of elements. They also provided a coNP-completeness proof for the verification 
problem, consisting of deciding whether a given subset is a fibre. This result holds even 
if the poset has height two, which again corresponds exactly to our setting, since every 
bipartite graph is the comparability graph of a poset of height two. Finally, they proved 
that the problem of deciding whether there exists a fibre of a given size is NP-hard. 

Our proof borrows ideas from the complexity results of Duffus et al. [6]. Our result can 
actually be reformulated as follows: given a poset P and an integer k, deciding whether 
P has a fibre of size at most k is -complete, even if P has height two. 



1.1.3. Clique coloring. Marx [15] recently published a proof that the following problem 
is complete for S^: Given a graph, is it possible to color its vertices with two colors so 
that no maximal clique is monochromatic? Hence we wish to decide whether there exists 
a partition of the vertices into two clique transversals. Our hardness proof uses the same 
general structure as Marx's reduction. 

Other contributions on the clique coloring problem include for instance Kratochvfl and 
Tusza [13] and Bacso, Gravier, Gyarfas, Preissmann, and Sebo [4]. 
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1.2. Plan. In the next section, we briefly review different definitions of the class £ P and 
give examples of problems that are known to be X P -complete. In Section 3, we give the 
proof of our main result. 

2. The Polynomial Hierarchy and the class £ p 

We briefly review standard material on the complexity classes defining the polynomial 
hierarchy. The reader is referred to the textbook of Papadimitriou [16] for a more exten- 
sive treatment, and to the compilation of Schaefer and Umans [17] for more examples of 
problems that are complete for some classes of the polynomial hierarchy. 

An oracle for a language L can be thought of as a subroutine that checks whether a 
given word belongs to L in constant time. When a Turing machine is given access to an 
oracle for some language that does not belong to P, the class of problems it can solve 
in polynomial time is possibly increased. The so-called polynomial hierarchy (PH) is 
a hierarchy of complexity classes that involves oracles for languages in NP. Note that 
in that case it does not matter which particular language L is involved as long as it is 
NP-complete. Also note that oracles for NP and coNP yield the same computational 
power. 

The class E P we will be interested in is also denoted NP NP , where the exponent means 
that the machine involved has access to an NP oracle. Hence this is the class of languages, 
or problems, that can be decided in polynomial time on a nondeterministic Turing ma- 
chine with access to an NP oracle. Similarly, the class is defined as coNP NP , and 
contains exactly the complements of the languages in S P . 

In general, PH is the union of A p = S p = n p = P and the following classes defined 
recursively for i ^ 1 : A p = P E £-i, S p = NP 5 ^- 1 , and nf = coNP 1 ^- 1 . So for instance 
S p =NP, n p =coNP, and n P = coNP NP . 

An alternative definition of the classes Sf and Uf makes use of canonical problems 
generalizing the NP-complete satisfiability (SAT) problem. Recall that the SAT prob- 
lem consists of deciding whether 3xf(x) holds, where f(x) is a boolean formula de- 
fined on the variables x = (x\,X2, ■ ■ ■ ,x n ). It can be assumed without loss of gener- 
ality that / is given in 3-conjunctive normal form (3-CNF), yielding the well-known 
NP-complete 3SAT problem. It can be shown that the problem of deciding whether 
3x 1 Vx 2 3x 3 . . . QiX l f(x 1 , x 2 , . . . , x l ), where / is a boolean formula, the x J are disjoint 
sets of boolean variables, and the existential and universal quantifiers alternate, is com- 
plete for E p (the quantifier Qi for x l is universal if i is even, and existential otherwise). 
Similarly, the problem Vx 1 3rc 2 Vx 3 . . . Q; L x % f(x l , x 2 , . . . , x l ) is complete for Ilf (here Q - L is 
universal if i is odd, and existential otherwise). 

We will use the corresponding problem for S P where / is a disjunction of terms of size 
three, or in other words in 3-disjunctive normal form (3-DNF). A Q-3-DNF formula is a 
formula ip = (f(x, y) with x = (x\, . . . , x n ) and y = (yi, ■ ■ ■ , y m ) where the Xj's and y^'s 
are distinct variables, which is of the form 



3x Vy (t M A ii, 2 A ti >3 ) V (i 2 ,i A t 2)2 A t 2)3 ) V • • • V {t qA A t q , 2 A t, j3 ) 
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where, for each I G {1,2, ... ,q}, the term (tg i A tf2 A ^3) consists of three literals 
corresponding to variables in {x±, . . . , x n } U {y\, . . . , y m }- Deciding whether ip holds true 
consists of checking whether there exists a boolean vector x such that for any boolean 
vector y, there is at least one term (t^i A A ^3) that evaluates to true. 

Lemma 2.1. It is £3 '-complete to decide whether a Q-3-DNF formula (p holds true. 

3. Proof of the Main Result 

We first show that a restricted version of the canonical problem for £3 remains complete 
for £3 . We then reduce from this problem in our hardness proof. We use the notation 
[n] := {1,2,..., n}, where n is any natural number. 

3.1. Nice and Monotone Formulas. Let ip(x,y) be a Q-3-DNF formula, with x = 
(xi, . . .,x n ), y = (yi, . . . ,y m ), and comprising q terms of the form (te t i A t^ 2 A ^3), 
i G [q]. The formula <p is said to be monotone if the three literals ^1,^2^3 are either 
all positive or all negative for each i G [q]; in this case a term of <p is said to be positive 
(negative) if its three literals are positive (negative, respectively). If moreover <p is such 
that, for each i £ [n], there exist a positive term and a negative term avoiding the literals 
Xi and Xi, respectively, then ip is said to be nice. (Note that such formulas have at least 
one positive term and one negative term.) 

Lemma 3.1. Given a nice monotone Q-3-DNF formula tp, it is ^-complete to decide 
whether p holds true or not. 

Proof. The problem clearly belongs to £2 , since it is a special case of the problem of 
deciding whether a Q-3-DNF formula holds true, which from Lemma 2.1 is complete for 
£2- For the hardness part, we reduce from the latter problem. Let thus ip = if)(x,y) be 
a given Q-3-DNF formula. 

We first transform -0 into a formula containing only monotone terms. For this, we can 
apply the same transformation as the one proposed by Gold [11] for proving NP-hardness 
of the monotone SAT problem. For each non-monotone term in tp, we add an extra 
variable in the group of variables that are quantified universally, and split the term into 
two new terms. Consider such a term and suppose without loss of generality that it has 
the form (a A b A c). (The case where there are two negative literals is symmetric.) Then 
we can split it into two terms as follows 

(a A b A c) o Vd (a A b A d) V (d A c). 

Doing this for every non-monotone term yields a monotone formula that holds true if 
and only if tp holds true. 

Now assume ip = ip(x,y) is formula of the above form; that is, the formula is monotone 
and each term has size two or three. We now make the formula nice. For each literal Xi 
such that there is no negative term avoiding Xj, include a new dummy term of the form 
(xi A z A z) , where z is a new variable that is quantified universally. This obviously does 
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not alter the validity of the formula. This term can in turn be split into two monotone 
terms using a second additional variable w: 

(%i A z A z) o \/w (xi A z A w) V (z A w). 

Those two new terms are monotone, and the second term is negative and does not contain 
Xi, as wanted. Apply the symmetric operation each literal literal x% such that there is no 
positive term avoiding X{. This ensures that the resulting formula is nice. 

Finally, each term (a A b) composed of two literals can be transformed into a term 
(a A b A c) of size three, where c is a new variable quantified existentially, so that the 
resulting formula is simultaneously monotone, nice, and Q-3-DNF. □ 

3.2. Main Result. For simplicity, a vertex-subset of a graph G meeting all maximal 
independent sets of G will be called a transversal of G. 

Theorem 3.2. Given a bipartite graph G and a positive integer k, it is Y^- complete to 
decide whether G has a transversal of size at most k. 

Proof. First, we notice that the problem belongs to £f . This poses no difficulty, as it 
is known that deciding whether a given subset is a transversal is coNP-complete, even if 
the graph is bipartite [6] . Hence the whole problem can be solved on a non-deterministic 
Turing machine with access to an oracle for deciding an NP-complete language. This is 
exactly the definition of ■ 

For the hardness part, we reduce from the problem of deciding whether a given nice 
monotone Q-3-DNF formula holds true or not, which is -complete by Lemma 3.1. Let 
<p = <p(x, y) be such a formula, with x = [x\, . . . , x n ), y = (yi, . . . , y m ), positive terms 
(*i,i A ti,2 A ti )3 ), . . . , (t q ,i A t q>2 A t q> 3), and negative terms A t' 12 A t' 13 ), (t' ql l A 

We construct a bipartite graph G from ip as follows. The vertex set of G is composed of: 

• vertices a^, bi, Xi, a~i, hi, x~i for each variable Xi (i 6 [n]); 

• vertices yj and y~j for each variable yj (j £ [m]); 

• vertices tg, rg, sg for each positive term [tg^i A tgp A tg^) (i G [q]), and 

• vertices t' e ,r' £ , s' £ for each negative term (t f e ± A t' e 2 A ^ 3 ) (£ € [q 1 ]). 

The bipartition of G we will consider is (P, N) with 

P = {a,, 6j, Xi : i £ [n]} U {yi, . . . , j/ m } U {^, rg : £ £ [q]} 
U{s' e :£e [q']} 

and := V(G) — P. (P stands for 'positive' and ./V for 'negative'.) The edges of G are 
determined as follows: 

• for each variable Xi, add the edge XiXf, 

• for each variable yj, add the edge yjyf, 

• for each i £ [n], link cij and bi to each vertex in TV — {xi,t>i} and iV — {ai,bi}, 
respectively, and a, and 6j to each vertex in P — {xi, bi} and P — {aj, respec- 
tively; 
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(a) Gadget for the vari- 
able Xi . The four maximal 
independent sets shown 
remain maximal indepen- 
dent sets in G. 




J i\ • i, «2 ^i?> 

(b) Gadget for the £th positive term (x ix A x i2 A x i3 ). 
The maximal independent set {rt,si,tt\ remains a 
maximal independent set in G, as the vertices re and 
se are adjacent to all other vertices in the opposite side 
of the bipartition. 



Figure 1. Gadgets for the reduction. 

• for each positive term (t^i A tip A ^,3), add the 3 edges tix^ , tgXi 2 , t£Xi 3 where 
x i d = ti,d f° r each d G {1, 2, 3}; 

• for each negative term (t' e 1 A t' i2 A t' e3 ), add the 3 edges t'^x^ , t'(Xi 2 , t'gXi 3 where 
Xi d = t' ed for each d G {1, 2, 3}; 

• for each I G [q], link 77 and sg to each vertex in N — {si} and P — {t£,ri}, 
respectively, and 

• for each £ G [q'], link r' e and s' £ to each vertex in P — {s' £ } and N — {t' £ ,r' e }, 
respectively. 

Let us consider the maximal independent sets of G. Say that such a set is regular if it 
avoids the four vertices a,i,bi,di, b% for each i G [n], the two vertices ri, S£ for each t G [g], 
and the two vertices for each t G [(?']. It will be helpful to have a characterization 
of the maximal independent sets of G that are not regular. 

Let thus S be such a set. First suppose that S n {cij, 6j, <Xj, 6j} 7^ for some i G [n]. 
Recall that and 6« are adjacent to all vertices in N, except, respectively to x~i,bi, and 
di,bi, and symmetrically for at and bi. Hence every maximal independent set containing, 
say, a,i and bi cannot be extended with vertices outside the variable gadget defined by 
the vertices ai,bi,Xi,di,bi,Xi (see figure 1(a)). Therefore, upon inspection of G, one can 
check that either 

• S is equal to P or N, or 

• S C {ai,bi,ai,bi,Xi,x~i} in which case S is one of the four sets 
{<H, bi, hi}, {en, bi,bi}, {a,i, bi,Xi}, {a.i, bi, xj, or 

• S D {di, bi, Hi, bi} = {di} in which case S is the union of {di,Xi} U 
{xi, . . . Xi-i, Xi + i, . . . , x n } U {yi, . . . ,y m } =: Si with the set 2j of all vertices 
t(_ {I G [q]) such that the literal Xi does not appear in the l-th positive term of if, 
or 

• S D {cij, 6j, Oj, 6j} = {aj} in which case S is the union of {cii,Xi} U 
{aci, . . . Xi-i, acj.fi, . . . , i n } U {yi, . . . , y m } =: with the set T/ of all vertices 
^ (■£ G [(/]) such that the literal Xi does not appear in the £-th negative term of 
p. 
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A crucial observation is that the sets Tj and T- defined above are non-empty for each 
i G [n]; indeed, this exactly corresponds to the requirement that the formula (p be nice. 

Now assume S n {cij, bi, Hi, hi} = for each i G [n]. Since S is not regular, we have 
S Pi {rg, sg} / for some £ G [g], or S 1 n {r^, s^} / for some £ G [</]. In the first case 

• either S 1 = {tg, rg, sg}, or 

• SP\{tg, rg, sg} = {tg, sg}, in which case S is the union of {tg, s^U-f^, . . . , t' q ,} =: 
with the set Zg of all vertices in {x\, . . . , x n } U {y\, . . . , 7/m} to which tg is not 
adjacent (that is, all negative literals that do not contradict the £-th positive 
term) . 

In the second case the situation is symmetric as expected; that is, 

• either S = {t' £ , r'g, s^}, or 

• S is the union of {t' e ,s' e } U {t\, . . . ,t q } =: W[ with the set Zg of all vertices in 
{xi, . . . , x n } U {y±, . . . , y m } to which t' e is not adjacent. 

Summarizing the above discussion, the maximal independent sets of G can be classified 
as follows: 

(1) regular maximal independent sets of G; 



(4) SiUTi for ie[n]; 

(5) S[\JTl for i € [n]; 

(6) {^,77,s^} for £ G [g]; 

(7) WgUZg for£€ [g]; 

(8) {t' e ,r' e ,s' e } for i£ [q'], and 

(9) Wj,UZ' e for ^ G [g 7 ]. 



Let k := 2n + q + (/'. We are now ready to show that the formula (p is true if and only if 
G has a transversal of size at most k. 

(=>) First suppose that (p holds true. We have to show that there exists a transversal 
X C V(G) of G of size at most k. We refer to the satisfying truth assignment for x as 
fi : {xi, X2, • • • , x„} i->- {True, False}. We define X as follows: 

X := : £ G [g]} U : ^ G [q'}} U {ftj, x 4 : /i(x;) = True} U {^, Xi : m(^) = False}. 

The set X has size exactly fc. We now have to check that X is actually a transversal. 
Since all vertices tg and £^ belong to X, the sets P and N, and every set of type 6 and 8 
above are hit. For the same reason, the independent sets of type 7 and 9 are hit as well. 
Since the sets of the form Tj and T[ are nonempty (see remark above), the independent 
sets of type 4 and 5 are also hit by the vertices tg and t' £ of X. And since X contains 
either {bi,x~i} or {t>i,Xi} for every i G [n], the independent sets of type 3 are all hit. 

Thus it only remains to check whether the regular maximal independent sets are hit by 
X. Suppose this is not the case and let U C V(G) be a regular maximal independent set 



(2) 
(3) 



P and N; 

{ai,bi,bi},{ai,bi,bi},{ai,bi,Xi},{ai,bi,Xi} for i G [n]; 
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that does not intersect X. In particular, U does not contain any vertex of the form t£ or 
t'g. The set U n {x\, x±, X2, X2, ■ ■ ■ , x n , x n }, by definition, contains exactly the literals that 
are set to True by the assignment fj,. Also, the intersection U n {yi, yi, y2, y~2, ■ ■ ■ , Vm, ym} 
can be interpreted as a truth assignment for y, as exactly one each pair of yj, y~j (j £ [m]) 
must be contained in U. In this truth assignment for y, we set yj (respectively, y~j) to 
True whenever it is contained in U. But now since this combined truth assignment is a 
truth assignment satisfying <p, there must exist a satisfied term. Let t denote the vertex 
corresponding to this term; that is, t = te if it is the £th positive term, t = t' e if it is the 
£th negative one. The vertex t is adjacent only to the three literals that contradict it. 
Since the term is satisfied, t is not adjacent to any vertex of U. But then U U {t} is again 
an independent set, contradicting the maximality of U. Hence such a U cannot exist, 
and X meets every maximal independent set of G. 

(^=) Now assume that G has a transversal X of size at most k. In order to deduce a 
satisfying truth assignment for x, we first need to transform X into some canonical form. 

Claim 3.3. If G has a transversal of size at most k, then it also has a transversal X' of 
size exactly k, with the following properties: 

• {h,...,t q }cx' ; 

• {t[,.. .,t' q ,} C X', and 

• for each i G [n], either {6j,x«} C X' or {bi,Xi} C X' . 

Proof. First we observe that the k maximal independent sets {ai,bi,Xi} (i £ [n]), 
{ai,bi,Xi} (i G [n]), {te,re,sg} (£ £ [q]), and {t' £ ,r' e , s' e } (£ G [q 1 ]) are all pairwise dis- 
joint. Thus \X\ = k, and X meets each of them in exactly one vertex. Moreover, 
\X n {xi, Xi}\ ^ 1 for each i E [n], since X would otherwise be disjoint from the maximal 
independent set {aj,6j,&j}. Let Qi := {bi,Xi} if Xi £ X, and let Qi := {bi,x~i} otherwise. 
(Let us recall that possibly X contains neither of Let 

n 

X' :={t 1 ,...,t q }U{t' 1 ,...,t' q ,}u(jQ i . 

i=i 

By definition X' is in canonical form and \X'\ = k. Thus it remains to show that X' is 
indeed a transversal. Using that X is a transversal and that 

• {t 1 ,...,t q }U{t' 1 ,...,t' ql }C\X'- 

• X n {xi, . . . ,x n } C X' n {xi, . . . , x n }; 

• X n {xi, . . . , x n } C X' n {xi, . . . , x n }, and 

• Xn ({yi,...,y m }U{yi,...,y m }) = 0, 

it directly follows that X' meets each regular maximal independent set of G. Hence it 
remains to consider the non-regular ones. Clearly X' intersects those of type 2, 3, 6, 7, 
8, and 9. (Here we use that q ^ 1 and q' ^ 1.) As for types 4 and 5, the sets Si U Tj and 
S'i U T[ have also a non-empty intersection with X' for each i £ [n], since Tj / and 
T[ 7^ by the properties of (p. Therefore X' is a transversal, as desired. □ 

So we now have a canonical transversal X', and we wish to show that (p holds. We 
construct a truth assignment fi : {x\, X2, ■ ■ ■ , x n } i-> {True, False} by letting /Lt(xj) to 
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True if and only if Xj X' . We need to show that this value for x is a witness that ip 
holds true. 

Consider a truth assignment // for x, y that coincides with |ioni, We can assume that 
this assignment is not trivial in the sense that not every variable has the same truth value. 
Otherwise, since there exist at least one positive and one negative term, the formula is 
trivially satisfied. Furthermore, since the formula is nice, it is also trivially satisfied by 
the assignment where only one negated literal xi holds true, and all others variables are 
set to True, or, symmetrically where Xi is set to True and all other variables to False. 
Hence we may assume that the considered assignment is not of this form either. 

Consider a maximal independent set U containing exactly the literals set to True by 
fi'. With the above assumptions, the maximal independent set U cannot be equal to P 
or N, as they correspond to trivial truth assignment, where everything is set to True 
(respectively, to False). Since U contains one literal for each variable, it clearly cannot 
be of type 3, 6, or 8. The set U cannot be of the form Si U Tj,i € [n] (type 4) either, 
as this case occurs when only one negated literal x~i holds true, and all others variables 
are set to True. Similarly, we can discard the case where U is of the form £ [n] 

(type 5). Finally, U cannot be of the form Wg U for £ 6 [q] (type 7), nor of the form 
Wg U Z' £ for i 6 \q'\ (type 9), since none of those two types of independent sets contain 
one literal for each variable. 

Hence the only remaining possibility is that U is a regular maximal independent set. 
Then U must contain a vertex te for some I £ [g], or a vertex t' e for some £ 6 [</], as 
otherwise it would not be hit by X'. Suppose, without loss of generality, that U contains 
t£. Then tg is not adjacent to any literal in U, hence it must correspond to a term of 
<p that is satisfied by the assignment. Since the reasoning holds for every assignment 
coinciding with fi on x, it shows that <p holds true, as needed. □ 
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